Assumed demographics, predicted behavior, and targeted incentives

ABSTRACT

A system and method for anticipating consumer behavior and determining transaction incentives for influencing consumer behavior comprises a computer system and associated database for determining cross time correlations between transaction behavior, for applying the function derived from the correlations to consumer records to predict future consumer behavior, and for deciding on transaction incentives to offer the consumers based upon their predicted behavior.

CROSS REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No. 12/870,764, filed Aug. 27, 2010, which is a division of U.S. patent application Ser. No. 10/559,937, filed Dec. 8, 2005, which was a National Stage entry of PCT/US04/26736, filed Sep. 13, 2004, which claims priority to U.S. provisional application Ser. No. 60/503,877, filed Sep. 22, 2003, and the entire contents of each of these applications are incorporated herein by reference.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The invention relates to the field of marketing. More particularly, the invention relates to data mining and targeted marketing.

2. Discussion of the Background

Marketing means communicating information regarding at least one of products and services to consumers.

Targeted marketing means selectively marketing to a limited number of consumers, such as an individual, members of a family, persons with the same residence as one another, or persons having some other piece of relevant information in common with one another. U.S. Pat. No. 5,832,457 to O'Brien discloses targeted marketing.

Value means a good, a service, or a pecuniary interest including cash, check, credit, and conditional credit.

Transaction, in this application, means an exchange involving at least two parties. A purchase is a transaction. Receipt of an incentive offer (such as the act of downloading an incentive offer from a web site), redemption of an incentive offer, and acceptance of participation in a consumer survey are transactions.

Purchase, in this application, means a transaction in which cash, check, charge, or credit is exchanged for one or more goods and services.

Transaction incentive, in this application, means an offer for a certain value in exchange for entering into a specified transaction.

Purchase incentive, in this application, means an incentive wherein the specified transaction for which the incentive is offered is a purchase.

Incentive, in this application, means the certain value associated with entering into a transaction specified in a transaction incentive.

Incentive terms and transaction incentive terms, in this application, each means the specifications of the transaction into which someone must enter to be entitled to the incentive associated with the transaction incentive.

Predictive modeling is disclosed in U.S. Pat. No. 4,961,089 to Jamzadeh. Predictive modeling in this application means using data corresponding to prior events to determine a probability of occurrence of an event during some time period in the future. An example of a predictive model would be a formula whose inputs are the number of sun spots recorded over each of the last several years and whose output was a probability of a number of sunspots in the next year falling within a specified range.

A prediction, in this application, means an assumption that one or more events, such as a transaction or a purchase, either are probable to occur or are not probable to occur.

A class, in this application, means a set of things that share a common attribute. An example of a class is the class of all things that are goods. Goods are items of merchandise for sale. Another example of a class is the class of goods that are packaged goods. Another example of a class is the class of packaged goods that are canned goods. Another example of a class is the class of all products that are sold under the same brand name or trademark. Another example of a class is the class of all products that are manufactured by the same manufacturer. Another example of a class is the class of all goods that are diet related. Another example of a class is the class of all goods that are prescription drugs. Another example of a class is the class of all products that are breakfast products. Another example of a class is the class of all service reservations that are rooming reservations, such as reservations at hotels and motels. Another example of a class is the class of all services reservations that are transportation reservations, including car, bus, train, and airline reservations. A class may consist of items of a single product, such as items having the same Universal Product Code (“UPC”). For example, all product items having the UPC code associated with 12 ounce glass bottles of Coca Cola define a class.

A database is a collection of data. Typically a database is organized in some fashion so that information contained in the data may be readily accessed. In this application the term database means data organized in some format in a computer memory that can be accessed by an associated computer system. Such a concept is also referred to as a database management system. A database or database management system includes commercial database products implemented in a computer system, such as the Microsoft Access and SQL Server line of products as well as any set of files stored in computer memory that can be accessed by an associated computer system.

Designated, as used in the foregoing example, refers to an association of the data defining the good or service with data defining an attribute. All data defining products and services associated with the same attribute are deemed to be in the same class defined by that attribute.

Data defining a good or a service may be stored in a database. Such data may be designated as a member of a class.

Transaction data, in this application, means data defining a transaction. Transaction data includes price of transaction, description of items purchased, price of items purchased, quantity of each type of item purchased, date of transaction, time of transaction, location of transaction, identity of purchaser, identify of seller, tender type of transaction, etc. Transaction data does not include description of purchaser beyond identity.

Non-transaction demographic data, in this application, means characteristics of people that are not part of transaction data. For example, non-transaction demographics data includes age bracket, gender, race, ethnicity, pet ownership. In addition, non-transaction demographics data includes income range, type of job or profession, education level, marital status, existence of children in an age bracket, and hobbies.

Block data, in this application, means data specifying averages, medians, or other statistical measures, by detailed product category and demographic, for a group of people living in a geographic region of a size on the order of a city block or subdivision. Typically, the dimension of a geographic region associated with a block, which is sometimes referred to as a block region, is between about 100 meters and about 1000 meters.

Survey demographic data is data provided by people in response to questions posed to them. Survey demographic data is not transaction data, as transaction data is defined herein.

Survey derived demographic data is data measuring a demographic of a person that is derived from the person's response to questions posed to the person.

Demographic data is often survey or survey derived demographic data, which is data obtained by peoples responses to questions posed to them. For example, people may be associated with a demographic named ‘Active Nutritionalists’ in response to either response to a question whether they are actively practice good nutritional habits or responses to a series of questions relating to the type of foods they purchase indicating that they have healthy diets.

Mixed derived demographic data, in this application, means demographic data derived from both non-transaction demographic data and transaction data. For example, mixed derived demographic data may be a value of a demographic variable inferred for a customer based upon the customer identifying ‘Active Nutritionalist’ in a survey question and regularly purchasing milk. For example, a mixed derived demographic value for this example could be “mother practicing good nutritional habits” since regular purchase of milk may indicate mother of small children demographic.

“Demographic” herein often is used as a noun, referring to a value of demographic data.

In this application, assumed means predicted.

In this application, dis-associated with means not associated with.

In this application, assumed data means a prediction that a property associated with the value of the data accurately characterizes a thing, such as a person or people associated with a CID (consumer identification).

In this application, assumed demographic data associated with a person or a CID means a prediction that the corresponding demographic accurately characterizes the person or people associated with the CID.

In this application, assumed non-transaction demographic associated with a person or a CID means a prediction that the corresponding non-transaction demographic accurately characterizes the person or people associated with the CID.

DISCLOSURE OF THE INVENTION Objects of the Invention

It is an object of the invention to provide to a consumer targeted incentives based upon a prediction whether the consumer will enter into a transaction.

Generally speaking, it is an object of the invention to identify transaction patterns of a set of consumers in an earlier time period that correlate to specified purchases by those consumers in later time periods.

More specifically, it is an object of the invention to identify transaction patterns of a set of consumers in one or more earlier first time periods that correlate to purchases by those consumers of one or more products and services within specified product and service classes in one or more later second time periods after the one or more earlier first time periods.

Generally speaking, it is an object of the invention to identify transaction patterns of consumers in a first time period that correlate to changes in rate or amount of purchases within a specific product or service class between the earlier first time period and the later second time period.

More specifically, it is an object of the invention to identify transaction patterns of a set of consumers in one or more earlier first time periods that correlate to changes in purchases by those consumers of one or more products and services within specified product and service classes between the one or more earlier first time periods and one or more later second time periods after the one or more earlier first time periods.

It is a further object of the invention to base a decision whether to offer a consumer a transaction incentive, such as a purchase incentive for a specified purchase within a certain product or service class, upon the existence of the foregoing transaction patterns in the transaction history records of the consumer that correlate to that certain product or service class.

It is a still further object of this invention to determine from the foregoing correlations a probability or a prediction that the specified consumer will purchase an item within a class of products or services.

It is still a further object of the invention to use the foregoing probability or prediction that the specified consumer will or will not enter into a purchase transaction for an item within a class of products or services as a factor in deciding what if any purchase incentives to offer to that specified consumer.

It is a still further object of this invention to determine from the foregoing correlations relative likelihoods, for each member of a set of consumers, that the consumer will purchase an item within a class of products or services, and to use that ranking in deciding which consumers to target market.

It is another object of this invention to perform targeted marketing by depending offering to a consumer a transaction incentive upon a probability that offering the incentive will change the consumer's purchase behavior for purchase of an item specified in the purchase incentive.

It is another object of this invention to perform targeted marketing by depending the value of a purchase incentive, the preconditions for the consumer to obtain the purchase incentive, or both, upon a quantitative determined probability or prediction of the consumer entering into a purchase transaction.

It is another object of the invention to perform targeted marketing so as maximize the expected return (either gross profit or net profit) to the seller or manufacturer of an item of a product or service based upon the cost of the incentive to the seller or manufacturer and a predicted probability of changing the consumer's purchase behavior.

It is also a general object of the invention to minimize the cost associated with targeted marketing campaigns by increasing the redemption rate of incentive offers.

It is another object of the invention to associate at least one demographic data with a Consumer Identification (“CID”) associated with a transaction history for a consumer, to thereby define an assumed non-transaction demographic data for the consumer.

It is another object of the invention to assign a probability that an assumed non-transaction demographic data for a consumer is correct.

It is another object of the invention to identify from a group of assumed non-transaction demographic data for a consumer, that non-transaction demographic data which has the highest probability of being correct.

It is another object of the invention to rank a group of assumed non-transaction demographic data for a consumer according to the probability of the assumed non-transaction demographic data being correct.

It is another object of the invention to disassociate at least one non-transaction demographic data with a CID associated with a transaction history for a consumer, to thereby define a dis-assumed non-transaction demographic data, that is, a non-transaction demographic data assumed to not be correct for a consumer.

It is another object of the invention to assign a probability that the dis-assumed non-transaction demographic data for the consumer is correct, and to rank a set of dis-assumed non-transaction demographic data for a consumer by their probability of being correct.

It is another object of the invention to determine whether to offer a specified purchase incentive to a consumer based upon at least in part upon one of assumed non-transaction demographic, mixed derived demographic, dis-assumed non-transaction demographic, most probable assumed non-transaction demographic, most probable dis-assumed non-transaction demographic, probability of assumed non-transaction demographic, probability of dis-assumed non-transaction demographic, ranking of assumed non-transaction demographic, or ranking of dis-assumed non-transaction demographic associated with that consumer.

It is another object of the invention to define a purchase incentive or select from alternative specified terms for purchase incentives a purchase incentive to offer to a consumer based upon at least in part upon one of assumed non-transaction demographic, dis-assumed non-transaction demographic, most probable assumed non-transaction demographic, most probable dis-assumed non-transaction demographic, probability of assumed non-transaction demographic, probability of dis-assumed non-transaction demographic, ranking of assumed non-transaction demographic, or ranking of dis-assumed non-transaction demographic associated with that consumer.

It is another object of the invention to define a demographic data assumption function that, when applied to a transaction history record for a consumer, will assume either that the demographic data is associated with the consumer or that the demographic data is not associated with the consumer.

It is another object of the invention to define a demographic data probability function that, when applied to a transaction history record for a consumer, will generate either a probability that the demographic data is associated with the consumer or that the demographic data is not associated with (dis-associated with) the consumer.

It is another object of the invention to determine from probability of each one of a set of demographics associated with a consumer and prior product incentive redemption history for that consumer which of a set of incentive offers the consumer is most likely to redeem.

It is another object of the invention to determine from probability of each one of a set of assumed non-transaction demographics associated with a consumer, a set of incentive offers for purchase of specified products available to be offered to the consumer, and probability of the consumer redeeming each one of the set of incentive offers, offering of which incentive is likely to result in the greatest profit.

It is another object of the invention to combine assumed non-transaction demographic data for consumers having transactions with a specified retail store location with block data for the same non-transaction demographic data, for at least one block region geographically close to the retail store location, to estimate the fraction of people having a specified demographic and living in the at least one specified block region that shop in the retail store.

It is another object of the invention to use the estimate the fraction of people having a specified demographic and living in the at least one specified block region that shop in the retail store location as a basis for deciding whether to target market consumers associated with that non-transaction demographic that live in that specified at least one block region.

SUMMARY OF THE INVENTION

These and other objects of the invention are provided by a novel computer system programmed or designed to analyze consumer specific data including historical consumer transaction data to identify statistical correlations (correlation data) between transactions in classes in different time periods.

The same or a different computer system may use the correlation data in combination with consumer transaction data for a consumer to rank or estimate probability of that consumer entering into a transaction in a specified class in a specified future time period.

The same or a different computer system may also be programmed to apply the rankings or probabilities to transaction incentive criteria to determine whether to offer a transaction incentive associated with the criteria to a consumer having the ranking or probability. The computer system may determine the mechanism for making the offer of a transaction incentive, the timing of making the offer of a transaction incentive, the value associated with the offer of a transaction incentive, the class or classes of transactions into which the consumer must enter in order to receive the incentive.

The method of the invention provides predictions of future transactions for a consumer that are based upon applying a specified consumer's transaction to one or more mathematical formula that define a likelihood of a consumer entering into one or more transactions, and wherein the mathematical formula incorporates correlation data obtained by statistical analysis of transaction data for a set of consumers over a time period.

The correlations resulting from consumer transaction data may be combined with correlations to consumer purchase preferences from other sources, such as correlations from demographic data and customer loyalty quotients. The correlations of a particular customer to these types of data may be incorporated into a ranking or probability function in predicting the likelihood of a consumer entering into a specified transaction in a specified future time period.

Some of the types of consumer behavior that may be predicted are changes in transaction behavior. For example the predictions may be that certain consumers will be purchasing more or less in a specified class, switching brands in the same product class, purchase larger or smaller values or quantities in a particular class, purchase a particular brand in a class when the consumer had not recently purchased in that class, or not purchase in a class the consumer previously purchased in (referred to herein as lapsing).

The particular class may be products and services associated with any one or more designations. Some examples of classes of goods are fresh meat, canned goods, baking goods, diet foods, breads, cereals, coffees, dairy products, alcoholic beverages, beers, wines, and liquors. More examples of classes are a specified brand of a type of product, such as Kleenex brand tissues, Welch's brand grape juice. More examples of classes are a brand, per se, such as Kelloggs, and Green Giant. A class also includes a group of brands, such as Keebler brands. Examples of classes include products, such as Green Giant eight ounce canned corn and Aunt Jemima one pound pancake mix. The particular class may be a group of products having a feature in common, such as lower fat products, organic products, or products of one or more brands known to be relatively low or relatively high priced.

The consumer transaction data may include data that are measures of any of the following: purchases of a consumer (including transaction class data and transaction date data), incentive redemptions by a consumer, reservations of a consumer, modality of offering transaction incentives that were redeemed by a consumer, frequency of transaction, dollar volume of transaction, item count of purchase, frequency of redemption, value of redeemable incentive, and relative amount of incentives redeemed in different classes. Preferably, all transaction data includes a date or a date range associated with each transaction.

The non-transactional data may include demographics, such as information involving family size, number and age of children, age, and household income bracket. The non-transactional data may also include personal preferences data provided by the consumer.

Customer loyalty quotient data is data indicating the fraction a customer's expenses in classes sold at a specified retail store or chain of retail stores that the customer actually purchases at that retail store or in the same chain of retail stores.

The method of the invention preferably uses transaction data from a plurality of distinct classes in determining correlations to a ranking or a probability of a consumer transaction in a specified class.

In certain instances, formula providing predictions of a consumer entering into a transaction in a specified class do not depend upon consumer transaction data from that class, but only from consumer transaction data for transactions from other distinct classes. This is particularly true when the formula is for a prediction that a consumer's purchases in a specified class will lapse or that the consumer will purchase in a class in which the consumer either has not previously purchased or has not purchased in a very long period of time, such as 2 months, 4 months, or 6 months.

The results of the ranking, probabilities, and predictions may be used as the basis of generating two generically different types of marketing. One type of marketing provides a transaction incentive that does not attempt to change the consumer's predicted transaction behavior. For example, a transaction incentive for purchasing in a class offered in response to a prediction that the consumer's purchasing in that class will lapse attempts to change predicted behavior. A second type of marketing does not provide an incentive to modify the consumer's predicted transaction behavior. For example, a transaction incentive for purchasing a brand in a class when there is a prediction that the consumer will purchase in that class does not attempt to change predicted behavior. In addition, a subgeneric type of marketing does not attempt to change the consumer's predicted purchase behavior as to a genus class but does provide an incentive for the consumer to change predicted transaction behavior in a species class of the genus. For example, a transaction incentive to purchase a second product brand in a specified class when there is a prediction that the consumer will purchase a first product brand in the specified class attempt to change predicted behavior as to the species class but not the genus.

Data as to probability of a consumer redeeming a transaction incentive depending upon the modality of the offer may be used to determine whether to provide a transaction offer derived by using the method of the invention via a point of sale (POS) coupon, email, postal mail, set top box, or personal web site, and data as to the correlated time period in which the prediction indicates consumer transaction behavior may be used to time the delivery of the transaction offer to the consumer.

In one aspect, the invention provides a system and method for anticipating consumer behavior and determining transaction incentives for influencing consumer behavior comprising a computer system and associated database for determining cross time correlations between transaction behavior, for applying a function derived from the correlations to consumer records to predict future consumer behavior, and for deciding which transaction incentives to offer the consumers based upon their predicted behavior.

In another aspect, the invention provides a system and method of its use comprising a database, a computer system having read and write access to said database; and wherein the database stores a plurality of consumer records including a first consumer record for a first consumer, wherein said first consumer record stores (1) CID data (consumer identification data) indicating a first consumer CID for said first consumer; in association with said first consumer CID, at least (2) transaction data in a first transaction class field indicating items transacted by said first consumer in a first transaction class during a first prior time period and (3) predictive data in a first predictive field indicating at least one of a ranking, a probability, and a prediction that said first consumer will transact in a first correlated class during a correlated time period, and wherein said correlated time period is subsequent in time to said prior time period. In additional aspects, said first class and said second class define the same class; said first class defines a different class than said second class; said first class is distinct from said second class; said first class is a genus and said second class is a species of said genus; said first class defines a species of a genus and said second class defines said genus; said database stores, in association with said first consumer CID, transaction data in a second transaction class field indicating items transacted by said consumer in a second transaction class during said first prior time period; said database stores, in association with said first consumer CID, transaction data in a third transaction class field indicating items transacted by said consumer in a third transaction class during said first prior time period; said database stores, in association with said first consumer CID, transaction data in a fourth transaction class field indicating items transacted by said consumer in a fourth transaction class during said first prior time period; said database stores, in association with said first consumer CID, transaction data in a fifth transaction class field indicating items transacted by said consumer in a fifth transaction class during said second prior time period; said database stores, in association with said first consumer CID, transaction data in a sixth transaction class field indicating items transacted by said consumer in a sixth transaction class during said second prior time period; said database stores, in association with said first consumer CID, transaction data in a seventh transaction class field indicating items transacted by said consumer in a seventh transaction class during said second prior time period; said database stores, in association with said first consumer CID, transaction data in an eighth transaction class field indicating items transacted by said consumer in an eighth transaction class during said second prior time period; the system is programmed to decide whether to offer a transaction incentive to said first consumer based upon said predictive data in said first predictive field for said first consumer; a term of said transaction incentive is purchase in said first correlated class; a term of said transaction incentive is purchase of a specified quantity in said first correlated class; a term of said transaction incentive is purchase of a specified brand in said first correlated class; wherein terms of said transaction incentive are purchase in said first correlated class during said correlated time period; wherein a term of said transaction incentive is purchase in a class other than said first correlated class; wherein a term of said transaction incentive is purchase of a specified brand not in said first correlated class; wherein terms of said transaction incentive are purchases in a class other than said first correlated class during said correlated time period; a terminal for presenting said transaction incentive to said first consumer; a printer for printing said terms of transaction incentive; said printer is at a POS terminal; said printer is at a Kiosk; said printer is at a consumer's computer; deciding whether to offer a transaction incentive to said first consumer based upon said predictive data in said first predictive field for said first consumer, printing said transaction incentive; and printing in the presence of said consumer.

In another aspect, the invention comprises a system and method for, in a set of customer records containing transaction data, correlating transactions in a first set of input classes in a first time period to transactions in a first correlated class in a second time period, wherein the second time period is subsequent to said first time period, thereby defining correlation data for said first set of classes; and deciding whether to issue a transaction incentive to a customer based at least in part upon transaction data for said customer in said first set of classes and said correlation data for said first set of classes. Additional aspects of the invention comprise correlating transactions in a second set of classes in said first time period to transactions in a second correlated class in said second time period, thereby defining correlation data for said second set of classes; and deciding whether to issue a transaction incentive to a customer based at least in part upon (1) transaction data for said customer in said first set of classes and said correlation data for said first set of classes and (2) transaction data for said customer in said second set of classes and said correlation data for said second set of classes; correlating transactions in a second set of classes in a second time period to transactions in a second correlated class in said second time period, thereby defining correlation data for said second set of classes; and deciding whether to issue a transaction incentive to a customer based at least in part upon (1) transaction data for said customer in said first set of classes and said correlation data for said first set of classes and (2) transaction data for said customer in said second set of classes and said correlation data for said second set of classes; a term of said transaction incentive is purchasing in said first correlated class; a term of said transaction incentive is purchasing a specified brand in said second correlated class; said transaction incentive is a transaction incentive for transacting in that one of said first correlated class and said second correlated class having correlated data with higher values; said deciding comprising determining whether said transaction data for said customer contains data indicating said customer transacted in at least one class of said first set having a relatively high correlation values compared to other classes of said first set; said at least one class comprises at least two classes; said at least one class comprises at least three classes; said at least one class comprises at least four classes; said deciding occurs while said customer is participating in a transaction at a POS terminal; offering said transaction incentive to said customer while said customer is participating in said transaction and at said POS terminal; said deciding occurs between transactions of said customer; mailing or e-mailing said transaction incentive to said customer; offering said transaction incentive to said customer when said customer is identified at a terminal; said terminal is a POS terminal; said terminal is a kiosk; said deciding whether to issue a transaction incentive is based at least in part upon non transaction data for said consumer; said non transaction data comprises demographic data; said non transaction data comprises loyalty quotient data; and said deciding whether to issue a transaction incentive is based at least in part upon block data. CID block data is statistical data associated with customers in a local neighborhood region.

In yet another aspect, the invention provides a system and method comprising means for, in a set of customer records containing transaction data, correlating transactions in a first set of input classes in a first time period to transactions in a first correlated class in a second time period, wherein the second time period is subsequent to said first time period, thereby defining correlation data for said first set of classes; and means for deciding whether to issue a transaction incentive to a customer based at least in part upon transaction data for said customer in said first set of classes and said correlation data for said first set of classes.

In yet another aspect, the invention provides a system and method comprising means for, in a set of customer records containing transaction data, correlating transactions in a first set of input classes in a first time period to a change in transactions in a first correlated class between said first time period and a second time period, wherein the second time period is subsequent to said first time period, thereby defining correlation data for said first set of classes; and deciding whether to issue a transaction incentive to a customer based at least in part upon transaction data for said customer in said first set of classes and said correlation data for said first set of classes. Additional aspects of the invention include correlating transactions in a second set of classes in said first time period to a change in transactions in a second correlated class between said first time period and said second time period, thereby defining correlation data for said second set of classes; and deciding whether to issue a transaction incentive to a customer based at least in part upon (1) transaction data for said customer in said first set of classes and said correlation data for said first set of classes and (2) transaction data for said customer in said second set of classes and said correlation data for said second set of classes.

In yet another aspect, the invention provides a system and method comprising means for (1) anticipating a consumer's behavior for purchasing a first product based at least in part upon at least a portion of that part of said consumer's transaction history including said consumer's transactions for products other than for purchase of said first product; and (2) basing an incentive determination upon said anticipating.

In yet another aspect, the invention provides a system and method comprising means for (1) anticipating a consumer's behavior for purchasing a first product based at least in part upon at least a portion of that part of said consumer's transaction history including said consumer's transactions for purchases of at least one product other than said first product; and (2) basing an incentive determination upon said anticipating. Additional aspects of the invention comprise that said incentive determination is based at least in part upon differences between said consumer's anticipated and prior purchase behavior for said product; said differences are based at least in part upon variations in frequency of purchase; said differences are based at least in part upon variations in quantity of purchase; said differences are based at least in part upon variations in utilization of prior incentive for purchase; said differences are based at least in part upon variations in brand loyalty; said anticipating is based at least in part upon analysis of data in consumer records; and said anticipating is based at least in part upon a model of consumer transaction behavior.

In yet another aspect, the invention provides a system and method comprising [0082] receiving transaction data at a POS terminal in a retailer computer system; transmitting said transaction data over a communications network to a central computer system; storing said transaction data in a database to which said central computer system has read and write access;

(2) analyzing consumer transaction data stored in said database; (3) determining transaction incentives to offer to consumers associated with said transaction data; wherein said central computer system is programmed to employ predictive modeling on said transaction data to determine a predictive modeling function, to determine at least one of rankings, probabilities, and predictions of future consumer transactions by applying consumer transaction data to said predictive modeling function, and determining transaction incentives based upon said at least one of rankings, probabilities, and predictions. Additional aspects of the invention comprise determining timing and method of transmission of said transaction incentives that will most likely result in said transaction incentives being redeemed; and transmitting said transaction incentive over said communication network to said retailer computer system.

In another aspect, the invention provides a system and method for associating at least one assumed demographic data with a Consumer Identification (“CID”) associated with a transaction history for a consumer, comprising: a database; a computer system having read and write access to said database; and wherein said database stores a plurality of consumer records including a first consumer record for a first consumer, wherein said first consumer record stores: (1) CID data (consumer identification data) indicating a first consumer CID for said first consumer;

in association with said first consumer CID, at least the following: (2) transaction data in a first transaction class field indicating items transacted by said first consumer in a first transaction class during a first prior time period; and (3) a first assumed non-transaction demographic data field for storing assumed non-transaction demographic data. Additional aspects of this invention include wherein said computer system comprises means or code for assuming a value for said first assumed non-transaction demographic data field of said first consumer record based upon data in other fields of said first consumer record; wherein said computer system comprises means or code for deriving a value for a derived data field of said first consumer record based upon data in other fields of said first consumer record; wherein said value for said assumed non-transaction demographic data field is a probability or an expectation; wherein said value for said derived data field is a probability or an expectation; means or code for identifying, from a group of assumed non-transaction demographic data fields in said first consumer record, that non-transaction demographic data field which has the highest probability of being correct; means or code for identifying, from a group including at least one assumed non-transaction demographic data field and one mixed derived demographic data field in said first consumer record, that non-transaction demographic data field which has the highest probability of being correct for the corresponding consumer; means or code for ranking a group of assumed non-transaction demographic data in said first consumer record according to the probability of the assumed non-transaction demographic data being correct for a consumer corresponding to said first consumer CID; and means or code for identifying, from a group including at least one assumed non-transaction demographic data field and one mixed derived demographic data field in said first consumer record, that non-transaction or mixed derived demographic data field which has the highest probability of being correct for the corresponding consumer; means or code for determining whether to offer to said first consumer an incentive based upon data stored in said first consumer record for at least one of an assumed non-transaction demographic, a dis-assumed non-transaction demographic, a mixed derived demographic, a most probable assumed non-transaction demographic, a most probable dis-assumed non-transaction demographic, a probability of assumed non-transaction demographic, a probability of dis-assumed non-transaction demographic, a ranking of assumed non-transaction demographic, and a ranking of dis-assumed non-transaction demographic associated with that consumer.

In another aspect, the invention provides a system and method for defining or selecting terms of at least one purchase incentive offer to associate with a Consumer Identification (“CID”) associated with a transaction history for a consumer, comprising: a database; a computer system having read and write access to said database; and wherein said database stores a plurality of consumer records including a first consumer record for a first consumer, wherein said first consumer record stores: (1) CID data (consumer identification data) indicating a first consumer CID for said first consumer; in association with said first consumer CID, at least the following: (2) transaction data in a first transaction class field indicating items transacted by said first consumer in a first transaction class during a first prior time period; (3) a first assumed non-transaction demographic data field for storing assumed non-transaction demographic data; and means or code for defining or selecting terms of a purchase incentive offer to associate with said CID wherein said defining or selecting depends at least in part upon one of an assumed non-transaction demographic, dis-assumed non-transaction demographic, a most probable assumed non-transaction demographic, a most probable dis-assumed non-transaction demographic, a probability of an assumed non-transaction demographic, a probability of dis-assumed non-transaction demographic, a ranking of an assumed non-transaction demographic, and a ranking of dis-assumed non-transaction demographic associated with that consumer. Additional aspects of this invention include defining or selecting depends at least in part upon assumed non-transaction demographic data.

In another aspect, the invention provides a system and method for defining a probability value that a demographic data value either is associated with a consumer or is not associated with the consumer, comprising: a database; a computer system having read and write access to said database; and wherein said database stores a plurality of consumer records; wherein each one of said plurality of consumer record stores: (1) CID data (consumer identification data) indicating a CID, (2) transaction data in a first transaction class field indicating items transacted during a first prior time period, and (3) a first actual demographic data field for storing actual, non-assumed, demographic data for a first demographic data variable; means or code for determining correlations between transaction data and data stored in said actual demographic data field from said plurality of consumer records and using at least one of said correlations to define a probability function, wherein said probability function is a function at least of transaction data, and said probability function applied to a consumer record generates a probability value that the consumer associated with said consumer record has a specified value for said first demographic variable; means or code for storing said probability value in a data field for said specified value for said first demographic value and said consumer record. Additional aspects of this invention include wherein said means or code for determining correlations uses data from at least two transaction data fields in said plurality of consumer records; wherein said means or code for determining correlations uses data in at least three transaction data fields in said plurality of consumer records; wherein said probability value is one or zero; means or code for determining, by applying a function to demographics associated with said consumer and prior product incentive redemption history for that consumer, which of a set of incentive offers said consumer is most likely to redeem; and means or code for determining, from probability of each one of a set of assumed non-transaction demographics associated with said consumer, a set of incentive offers for purchase of specified products available to be offered to said consumer, and probability of said consumer redeeming each one of said set of incentive offers, which incentive offer, if actually offered to said consumer, is likely to result in the greatest profit.

In another aspect, the invention provides a system and method for estimating a fraction of people having a specified demographic value that both shop in a specified retail store and live in a specified block region comprising: a transaction data database; a block data database; a computer system having read and write access to said transaction data database and said block data database; and wherein said transaction data database stores a plurality of consumer records each including at least a CID data (consumer identification data) indicating a consumer CID and a first assumed non-transaction demographic data field for storing assumed non-transaction demographic data for customers of said retail store; wherein said block data database stores at least one block data record for a geographic region near the location of the retail store, each block data record storing, for its corresponding geographic region, a number of people or consumers residing in that region, and a number of people or consumers having said specified demographic value; and means or code for using data stored in said both said transaction data database and said block data database to estimate said fraction of people having said specified demographic value that both shop in said specified retail store and live in said specified block region. Additional aspects of this invention include means or code for generating a decision whether to target market to consumers associated with said specified demographic value that live in said geographic region; means or code for determining a number of consumers associated with said specified demographic value that live in said geographic region; means or code for determining whether to target market to consumers in said geographic region based upon either said number of consumers associated with said specified demographic value that live in said geographic region or an estimate of a ratio of number of consumers associated with said specified demographic value that live in said geographic region to total number of consumers that live in said geographic region; means or code for determining an estimate of total value of goods purchased from said retail store by consumers associated with said specified demographic value that live in said geographic region; and means or code for determining whether to target market to consumers residing in said geographic region based upon an estimate of either said total value of goods purchased from said retail store by consumers associated with said specified demographic value that live in said geographic region to total value of goods purchased in said block region from consumers having said specified demographic value.

In another aspect, the invention provides a system and method for determining whether to target market to consumers residing in a block regions, comprising: a transaction data database; a block data database; a computer system having read and write access to said transaction data database and said block data database; wherein said transaction data database stores a plurality of consumer records each including at least a CID data (consumer identification data) indicating a consumer CID for a customer of said retail store; wherein said block data database stores at least one block data record for a geographic region near the location of the retail store, each block data record storing, for its corresponding geographic region, a number of people or consumers residing in that region, a number of people or consumers residing in said region that have a specified value or range of values for a specified transaction history variable; means or code for estimating either a fraction or an absolute number of consumers residing in said block region having transaction data having either said specified value or said range of values for said specified transaction history variable to define an estimate of said fraction or said absolute number; and means or code for determining whether to target market to either said block region or to at least one consumer residing in said block region based upon at least said estimate of said fraction or said absolute number. Additional aspects of this invention include means or code for estimating estimates absolute number of consumers residing in said block region having transaction data having either said specified value or said range of values for said specified transaction history variable; means or code for estimating estimates a fraction of consumers residing in said block region having transaction data having either said specified value or said range of values for said specified transaction history variable; wherein said specified transaction history variable is a quantity of spending in a prior time period; wherein said specified transaction history variable is quantity of purchase of a specified product in a prior time period; wherein said specified transaction history variable is quantity purchase in a specified class of products in a prior time period; wherein said specified transaction history variable is a measure of redemption of transaction incentives in a prior time period; wherein said specified transaction history variable is a measure of redemption of transaction incentive in a specified class of transaction incentives in a prior time period; wherein said means or code for determining whether to target market to either said block region or to at least one customer residing in said block region also depends upon at least demographic data variable; wherein said means or code for determining wether to target market to either said block region or to at least one customer residing in said block region also depends upon at least one derived data variable; and wherein said means or code for determining wether to target market to either said block region or to at least one customer residing in said block region also depends upon at least one mixed data variable.

BRIEF DESCRIPTION OF THE DRAWINGS

The invention is better illustrated in connection with the following figures.

FIG. 1 is a schematic of a computer network system;

FIG. 2 is high level flow chart;

FIG. 3A is an intermediate level flow chart of step 210 in FIG. 2;

FIG. 3B is an intermediate level flow chart of step 210 in FIG. 2;

FIG. 3C is an intermediate level flow chart of step 210 in FIG. 2;

FIG. 4 is an intermediate level flow chart of step 220 in FIG. 2;

FIG. 5 is another intermediate level flow chart of step 220 in FIG. 2;

FIG. 6 is a graph relating to probability outputs of the model;

FIG. 7 is a table illustrating a consumer record;

FIG. 8 is a table illustrating a consumer record with purchases accumulated by month;

FIG. 9 is a table illustrating cells containing fractions representing the average for that cell in a set of records;

FIG. 10 is a table illustrating a cells containing fractions representing the averages for that cell in a set of records wherein all records have a certain value in the cell in row 5/00 and column P12; and

FIG. 11 is a table illustrating the difference between tables 9 and 10, which represents correlation data of each cell to the value of the cell in row 5/00 and column P12.

DETAILED DESCRIPTION OF THE EMBODIMENTS

FIG. 1 shows a computer network system including network 10 (preferably the Internet), analysis computer system 20 including server system 20 a, database 30, manufacturer computer system 40, retailer computer system 50, consumer computer 60, databases 70, 80, 90, point of sale (POS) terminal 100, and kiosk 110. The lines connecting elements in FIG. 1 indicate a means for data transmission including wire and wireless transmission hardware, data format, and transmission protocols. Each computer system includes at least one digital computer including associated central processing unit, memory, input, and output devices.

Data stored preferably in database 30, but which may be stored in any one of databases 30, 70, and 80 includes at least one of purchase transactions data, redemption transactions data, reservations transactions data, demographics data, and loyalty quotient data in association with a consumer identification.

Preferably, the transactions data for each transaction is stored in association with a date of the corresponding transaction. Redemptions transaction data may be stored with more than one date: the date of the transaction and the date or dates of the reservation. Redemption transactions data may be stored in association with more than one date including the date of the redemption transaction and the data upon which the incentive transaction resulting in the redemption was offered. Demographics data and loyalty quotient data may also be stored in association with at least one date, such as the date upon which the demographics data was received or the date on which the loyalty quotient was calculated. Demographics data and loyalty quotient data may also be stored in association with another date or date range representing the date range over with data resulting in the demographic data or loyalty quotient data was received.

All data received in step 20 relating to a particular consumer identification (CID) is stored in the database in association with that CID. The database may be formatted as flat files or as one or more relational database files and may include tables, forms, queries, relations, reports, modules, and other objects useful in database management and programming. For convenience, the data analysis will be described below in the context of consumer records in which each record includes a field for CID (or CID in combination with store ID) and a large number of associated fields. However, it will be apparent to those skilled in database programming that the relations between the data may be otherwise than as specifically described, for example, by having data stored in third normal form.

The exemplary data format comprises a table with transaction records in which each record includes fields for CID, store ID, dates of purchase, identification of products, services, and reservations purchased in association with the date of purchase, identification of transaction incentives offered including terms of the offering in association with the date of offering and dates of redemption of each transaction incentive. Each record may also include time of day of transaction, day of week of transaction, day of month of transaction, form of payment (such as credit card, cash, check). Each transaction record may include a currency amount of each product item, transaction total currency amount, currency amount of redemptions, incentive amount of offered transaction incentives, incentive amount associated with product item, frequent shopper ID, amount of purchase less discounts and incentives, manufacturer identifications associated with products for which each incentive was offered or redeemed. Each transaction record may use a universal product code (UPC) as the identifier of a product, service, reservation.

Preferably, there is one record per customer either per transaction, per transaction date, or per date range.

The database may also store in association with a CID demographic data including age, gender, income, household income, location of residence, location of work, postal code, family size, number of children, marital status, gender, type of commuting vehicle, length of commute, type of job.

The database may also store in association with a CID block data identification. The database may also store in association with the CID or the block data identification block data, which is data indicating statistical averages for transactions and other data for consumers that live in a specified block group.

The database may also store, in association with a CID, fields derived from data, such as fields derived from demographic data for the customer, block data for the block to which the customer belongs, and loyalty quotient data for the customer's loyalty to a store, brand, or any class of transactions. These derived fields may contain data representing likelihood of a consumer purchasing in a specified class, at a specified store, or in a store of a specified retail chain.

Derived fields also include fields for frequency of specified classes of transactions, including purchases, redemption, and reservations, such as for specified products, over a specified time period.

Preferably, analysis system 20 is either hard wired or programmed to perform the algorithms disclosed herein for predictive modeling, and the applications of models to rank, determine probabilities, or make predictions. However, retailer computer system 50 or manufacturer computer system 40 may be programmed to perform those functions.

FIG. 2 shows step 200 of storing data, step 210 of predictive modeling, step 220 of determining incentive data and step 230 of offering incentives.

In step 200 a database, such as database 30, receives and stores consumer related data. Alternatively, any one of databases 70 and 80 may receive and stores consumer related data for performing the following steps.

In step 210, system 20 runs software or implements in hardware a form of predictive modeling to analyze a set of data for multiple CID records in database 30. The predictive modeling identifies correlations between the existence of transactions in one or more classes of transactions in one or more time period and the existence of transactions in one or more transaction classes (hereinafter the correlated class) in at least one subsequent time period (hereinafter the correlated time period). The predictive modeling may also include other correlations, such as correlations based upon a consumer's demographic, block data, and loyalty quotients.

Generally, speaking, the predictive modeling applies the statistical correlations to any consumer's transaction data to predict what transaction behavior that consumer is likely to enter into in the correlated time period. The model may generate a probability, a prediction, and it may rank the relative likelihoods of each member of a set of consumers entering into a class of transactions.

For example, in step 210, system 20 may determine the probability that a CID (that is, people or households associated with the CID) that had no purchases of coffee associated with it will purchase coffee in the next month. In step 210, system 20 may rank CIDs not previously associated with purchase of coffee according to their probability of purchasing coffee in the next month. As another example, system 20 may rank households that had previously purchased coffee by their probability of not purchasing coffee in the next month (lapsing). As another example, in step 210, system 20 may determine the probability of a CID not purchasing at a retail store in the next month.

In step 220, system 20 determines incentive data for CIDs, in which the determination is based at least in part upon the rankings generated in step 210.

In step 230, system 20 takes action that will ultimately provide the transaction incentive to the consumer. This includes saving the data in a file for displaying or printing in human readable form, transmitting the data to a POS computer system, an Internet web site, an email address associated with the CID, or a postal mailing facility. Data defining a transaction incentive for a consumer may be provided to the consumer via email, at Kiosk 110 upon the consumer being identified by the Kiosk, at a POS terminal when the consumer's CID is associated with a transaction at that terminal, or postal mailed to the consumer's postal address associated with the CID.

FIG. 3A shows steps relating to step 210 that system 20 may implement in determining a probability of a particular consumer changing a particular class of transactions behavior in a specified class. The specified class and the correlated time period are represented by a specified data field.

In step 310 a, system 20 receives input filter criteria. Input filter criteria are criteria that are values for or depend upon statistical correlations of input fields for data associated with consumers to the data for the specified data field. The input fields are fields that store data for transactions occurring in one or more time periods prior to the correlated time period.

Examples of filter criteria depend upon the specific predictive model. However, an example of filter criteria is criteria specifying the 1, 3, 5, or 10 fields most highly correlated to the specified data field. Another example of filter criteria is all fields having a value for correlation to the specified data field of greater than one of 0.6, 0.7, 0.8, or 0.9. Another example of filter criteria is all fields having a correlation to the specified data field of greater than 0.6 and which are fields associated with transactions from a different class than the specified class.

In step 320 a, system 20 applies the input filter criteria to a set of records in order to determine which fields meet the criteria. The data fields that meet the criteria are the data fields used in subsequent steps involving the transaction data. System 20 applies the input filter function using the input filter criteria to a set of data records from database 30 to define a subset of data fields in the records. Preferably, the set of records is large enough so that the results of the predictive modeling are statistically significant. This means that at least about 50 records, preferably at least 100 records, and typically 500 to 20,000 records are included in each predictive modeling.

The input correlation filter criteria preferably includes at least one cross-correlation value between a field other than the specified data field and the specified data field. A cross-correlation means a value for correlation between the specified data field and other data fields in the CID record, for a set of CID records. Number of cross-correlations means a number, examples being 2, 4, 10, 20, 50, and 100 of cross-correlations defined by different sets of two data fields, one of which is preferably the specified data field. The input filter function ranks the cross-correlations to the specified data field of other data fields, and limits the input fields for step 330 a to those data fields most highly statistically correlated to the specified data field.

The input filter function is applied only because current digital computer systems are in practice not currently powerful enough to implement the remaining steps of FIG. 2, given a very large number of input fields, such as the several thousand product specific transaction classes defined by individual UPC codes. The filtering steps are therefore optional and may not be practiced when sufficiently powerful digital computers become available. Moreover, instead of automated filtering, a user may select which fields to use for correlation in the following step.

The specified data field is preferably a data field whose data indicates quantity of transactions in the specified class in the correlated time period. However, the specified data field may also be a field whose data indicates a ratio of quantity of transactions in the specified class in the later time period (correlated time period) to quantity of transactions in the specified class in one of the earlier time periods. Quantity in this sense may be measured by number of items, volume of items, cost paid for items, or any other means to measure quantity.

In step 330 a, system 20 applies a predictive modeling algorithm to the input data fields defined in the filtering step. The predictive modeling algorithm is designed to result in a function that can be applied to a consumer record to generate a value. The value obtained by applying the predictive modeling algorithm to a record for a consumer is a rank, probability, or prediction that the actions of the consumer in the correlated time period will result in data in the specified field. For example, the predictive model algorithm may assume a predetermined functional form, then determine cross correlations of the input data fields to the correlated data field, and then set coefficients in the predetermined functional form to certain values based upon the values of the correlations that were obtained between the input data fields and the specified data field.

The covariance, C, of two variables, A and B, is defined as the expected value of their product E{AB} minus the product of their expected values E{A}E{B}. C=E{AB}−E{A}E{B}. The correlation coefficient r of the variables A and B is by definition: r=C/sigma(A)sigma(B). Two variables A and B are uncorrelated if their covariance is zero. The correlation coefficient or the covariance may be used as the measure of the correlation between the target product or class and the input data fields.

The predictive model function is a function of the values of a consumer's transactions in specified classes corresponding to the input data fields. Those classes are the classes which were the inputs to the predictive modeling used to derive the correlations used in defining the predictive model function for the specified class.

Preferably, the predictive model algorithm's predetermined functional form also includes as input variables certain demographic variables, loyalty quotient variables, and block data variables. In this alternative, the resulting predictive model function also depends upon the data values in a consumer record for the corresponding certain demographic fields, loyalty quotient fields, and block data fields. In this alternative, the value of the predictive model function reflects the likelihood of the consumer performing the specified transaction in the correlated time period, including the impact on that likelihood due to demographic data, loyalty quotient date, and block data in the consumer's record.

In step 340 a, system 20 applies the predictive model function defined in step 330 a to a set of consumer data records. The predictive model function generates data stored in association with each CID. The generated data is preferably a ranking probability value, or prediction, Ps (predicting data), for the consumer transaction in class S for that CID in the correlated time period. The predicting data may be stored to indicate probability or prediction of purchase in the specified claims by the consumer having the specified CID in the correlated future time period. The correlated time period is typically the next week, next month, next six weeks, or next three month period compared to the most recent time period associated with data in the input data fields. However, the correlated time period may be a discontinuous time period from the time period representing the transaction dates for data in the database, such the following week, month, or quarter. The predicting data may store data indicating a probability of reduction or increase in quantity of transaction in the specified class in the correlated time period, such as the next week, month, or quarter.

Typically, one set of data records is used in steps 310 a to 330 a in order to define the predictive model function. Then the predictive model function is applied to a different set of records in step 340 a to generate rankings, probabilities, and predictions for a correlated time period corresponding to a future time period for use in making decisions regarding transaction incentives to offer to the corresponding consumers.

Alternatively, the predictive model function may also be applied to the different set of consumer records in step 340 a to generate rankings, probabilities, and predictions for a correlated time period, but wherein the correlated time period is for a historical time period in which there is actual transaction data in the set of records. In this alternative, the rankings, probabilities, or predictions for the correlated time period may be compared to the actual transaction data for the correlated time period to determine how closely the predictions match actual data. This comparison may be used to generate a predictive model function validation value for the specific predictive model function.

In step 350 a, system 20 may automatically compare the predictive model function validation value to a predetermined value (such as the results of prior predictive model function validation values for models predicting the same specified data field) to determine whether to return to step 310 a. If the system 30 returns to step 310 a, it automatically obtains a different set of input filter criteria based upon predetermined programmed parameters and then repeats steps 320 a to 350 a. Alternatively, the decisional step 350 a may be user controlled, in which case system 20 waits for user instructions once completing step 340 a.

When decisional step 350 a does not loop back to step 310 a, processing proceeds to step 220 of FIG. 2. Typically, processing will only proceed to 220 when the correlated time period is a future time period.

Steps 340 a and 350 a enable feedback on the effect of the form or the coefficients of the predictive model function. Thus, system 20 may store in code alternative functional forms which in addition to filter criteria may change between iterations of cycle 310 a, 320 a, 330 a, 340 a, and 350 a. System 20 may store a plurality of validation values in association with their predictive model functions for the specified data field, and may store validations values in association with their predictive model functions for a plurality of specified data fields. Preferably, prior to proceeding to step 220, system 20 has determined the predictive model function providing the largest validation value for each specified data field and calculated and stored the corresponding ranking, probability or prediction data for each specified data field.

Preferably, a predictive model is generated using a relatively small subset of all CID records stored in a database, for computational efficiency. However, the predictive model function or functions derived from the algorithm of FIG. 3 a may subsequently be applied to any subset or all CID records stored in database 30. Preferably, the predictive values, Ps, are determined for a large number of CID records in database 30 and of course each Ps is stored in a field associated with the corresponding CID.

FIGS. 3B and 3C also show slightly different algorithms corresponding to step 210.

In step 310 b, system 20 (or a user) selects a target product or product class. The specified product or class is one for which a prediction will be generated as to the probability a consumer will change purchase behavior. For example, the target could be a product class, such as breakfast foods; a product type, such as cereal; a product brand, such as Kellogg's cereal; a specific product, such as Kellogg's Cons Flakes; or a specific product size, such as a 16 ounce box of Kellogg's Corn Flakes.

As in FIG. 3A, the transactions data could be any measure of the consumer's purchase behavior, such as frequency of purchase, dollar value of purchase, item count of purchase, frequency of redemption of incentives, value of redeemable incentive, relative amount of incentives redeemed in different classes, price elasticity, or a measure of responsiveness to advertisements or marketing such as measure of redemption of coupons, use of vouchers, or request for rebates.

In step 320 b, system 20 determines whether a correlation exists between purchase data for a target product in a correlated later time period and the other data in fields in a set of consumer transaction records in database 30 corresponding to transactions in an earlier time period.

System 20 may determine a correlation coefficient for a variable, A, that is a measure of behavior in a later time period for a specified or target product with purchase in an earlier time period of a non-specified product (or other recorded transaction activity).

System 20 may determine correlations of A with a set of variables Bi for i=1 to n where each Bi is a measure of the number of a non-target product items or volume of purchase of a non-target product item in the earlier time period.

In step 330 b, system 20 selects a subset of variables, B, providing relatively high correlations with the targeted product based on the results of step 320 b. For example, assume system 20 is predicting which households will be purchasing cereal during the following month. System 20 identified a correlation between the target product of cereal and five variables: B1, B2, B3, B4, and B5, which are measures of monthly purchases of more than three gallons of milk, monthly purchases of at least three loaves of bread, the presence of two or more children under the age of 15 in the household, a monthly grocery bill exceeding 350 dollars, and the absence of purchase of Kellogg's Pop Tarts during a one-month period, respectively. In step 330 b, system 20 selects this subset of 5 variables to include in a predictive model. In step 330 b, system 20 defines a predictive model based upon the number of variables selected and the correlations of each one of those variables with the target product. For example, the predictive model may set the probability Ps that a consumer will purchase the target product in a future time period to a normalized set of values for the sum a1*B1+a2*B2+a3*B3+a4*B4+a5*B5 in which the weighting coefficients a1 to a5 in the linear combination are proportional to the corresponding correlation coefficients between the B data fields and the data for the specified product in the correlated time period. Given this predictive model function, for example, if the target is the class of cereal, system 20 may predict a high probability of reduced purchase of cereal, or a low probability of purchase of cereal, for a CID record which indicates purchases of more than three gallons of milk in the last month, purchases of more than three loaves of bread in the last month, existence of three children in the household, household grocery expenditures of at least $350 in the last month, and information that the household did not purchase Kellogg's Pop Tarts during the one-month period. Depending upon the correlations and the resulting probability function, the high probability of reduced purchase of cereal, or a low probability of purchase of cereal could both be inverted.

In step 340 b, system 20 applies the probability function to the subset of variables included in step 330 b in a probability function to obtain a value that will indicate either the probability that a consumer will purchase the target product in a specified future time period or the probability that the consumer will change purchase behavior for the target product.

In step 310 c, the target product or purchasing behavior that the model is to predict is entered into system 20. That is, either a data field, set of data fields, or function of one or a set of data fields in CID records is specified to system 20.

In step 320 c, system 20 examines a certain set of CID records in database 30.

In step 330 c, system 20 checks each record in the set to ascertain which consumer records show the presence of the predicted behavior. If the predicted behavior is present in the customer record, system 20 performs step 340 c. If the consumer record does not meet the predicted behavior, system 20 performs step 360 c.

In step 340 c, if the consumer record meets the predicted behavior, system 20 adds the consumer record to a target matching database table.

In step 350 c, system 20 checks to see if there are additional consumer records to access. If there are additional consumer records to access, system 20 performs step 320 c. If there are no more consumer records to access, system 20 performs step 365 c.

In step 360 c, the consumer record does not meet the predicted behavior and system 20 adds the consumer record to a non-target matching table in database 30.

In step 365 c, system 20 determines correlations between the targeted purchasing behavior and the non-targeted data fields. The correlations are discernable by comparing the mean value for each non-targeted product in the target matching database to the mean value for that non-targeted product in the non-target matching database. A significant difference in the mean values indicates that a correlation exists between non-targeted data field and the targeted purchasing behavior. The larger the difference in mean values, the larger the correlation. System 20 can define a predictive model as discussed for FIGS. 3A and 3B based upon the values of the correlations between the non-targeted products and targeted purchasing behavior.

In step 370 c, system 20 reads a customer record from database 30 and applies the predictive model to determine probability of the targeted purchasing behavior occurring.

In one embodiment, the generation and storing of ranking, probability, or prediction criteria is bypassed. In this embodiment, incentive decisions are made by determining whether certain data exists in the customer's transaction record, wherein the certain data are in fields found to be highly correlated to the specified class or specified transaction data. In this embodiment, a decision whether to offer a transaction incentive for a transaction in the correlated time period in the specified class depends upon the existence of the predetermined pattern of transactions found to be correlated to the specified data field. This pattern matching based decisional process has the advantage of avoiding the complexity of storing intermediate data, namely the ranking, probability, or prediction data for each customer.

FIG. 4 shows steps relating to one example of step 220, determining transaction incentives from predictions obtained from a predictive model function. In step 410, system 20 determines whether the probability that a consumer (i.e., a consumer or household associated with a CID) who has purchased W will cease purchasing W (indicated by the subscript “−W”) and decides based upon that probability whether to offer an incentive to that consumer for purchase of W. As shown, if the probability of the consumer changing his or her purchase behavior to stop purchasing W is greater than or equal to 0.8, system 20 decides to implement step 420 and provide an incentive on purchase of W to the consumer. The value of the incentive may depend upon the value of the probability. If the probability is less than 0.8, system 20 implements step 430 and decides not to offer the consumer an incentive for purchase of W.

FIG. 5 also shows steps relating to step 220. In step 510, system 20 determines the probability that a consumer a consumer or household associated with a CID) who has purchased W will cease purchasing W, and decides based upon that probability whether to offer an incentive to that consumer for purchase on a product correlated to W. As shown, if the probability of the consumer changing his or her purchase behavior to stop purchasing W is greater than or equal to 0.8, system 20 decides to implement step 520 and provide an incentive for purchase on a product correlated to W to the consumer. The value of the incentive may depend upon the value of the probability. If the probability is less than 0.8, system 20 implements step 530 and decides not to offer the consumer an incentive for the purchase of W.

FIG. 6 shows CID records ordered by probability, from highest to lowest probability extending along the X axis. FIG. 6 shows the percent of CIDs (from 0%, 10%, 50%, 100%) on the X axis having the probability shown on the Y axis. Here, probability means the probability for a CID having data in the specified data field.

In optional step 350 a, filter criteria are changed, and the process of steps 310 a-340 a are repeated based upon the new criteria. The change may be programmatically implemented based upon programmed criteria or manually implemented by commands entered by a user via an input/output device.

FIG. 6 shows a ranking of consumers by probability of consumers changing from previously purchasing product W to not purchasing product W (indicated by “−W”). As shown, 10% of CIDs have a probability of greater than 0.7. These consumers may be ideal candidates for an incentive. The consumers whose CIDs are represented on the X axis by the 50% to 100% range are shown having probabilities of changing purchase behavior on W of less than about 0.4. This means that one half of all consumers having CID records in the system are determined to have less than about 0.4 probability of ceasing their purchases of W.

FIGS. 4-6 relate to predicting the probability of a consumer changing behavior from purchasing a product in one time period to not purchasing that product in a subsequent time period. In those cases, one marketing possibility is to target market consumers having a high such probability. Alternatively, the system and method disclosed herein may be used to determine consumers that have not previously purchased W who will start purchasing W. Decisions on incentives to those consumers for W and for products correlated to purchase of W may be made depending upon this probability.

System 20 could decide whether to issue an incentive offer based solely on a score generated by the model for a target product. The system could also decide to issue different incentive offers based on score. Examples of different incentive offers include free target products; discounts on the target product, offers on a different brand in the same class as the target product, offers on a product complimentary to the target product; and discounts on purchases of multiple packages the target product. Complementary products are those products often used on conjunction with one another, such as milk and cereal, or cheese and crackers, or reservations for flights and reservations for hotels.

In addition, an incentive program could be part of a marketing campaign or as a method of generating further information on the purchasing habits of consumers to help improve accuracy of system 20's predictive modeling.

In addition to recognizing the presence or absence of a product, system 20 may be programmed to account for the consumer's changes in purchase level for specific products. For example, a consumer purchased eight gallons of milk in February/March but only four gallons in April/May. System 20 could use milk purchase data for both prior time periods to account for tendencies that are common to the population as a whole. For example, people may not drink as much milk when it is hot outside. System 20 could use milk purchase data to account for tendencies that are specific to the consumer. For example, system 20 may determine from records based on at least one of the consumer's airline tickets, hotel records, and grocery purchase records from a location distant from the customer's normal place of purchase that the consumer had been out of town for a certain time period.

Detailed examples of algorithms for performing the invention follow in order to explain aspects of the invention and identify types of transaction incentives applicable to the results of the predictive modeling.

First, obtain a consumer transaction database containing transaction data for a first set of consumers. This transaction database may also include demographic data. In this example, the database contains transactions for classes P1, . . . P12. P1, . . . P12 may each represent the products having the same Universal Product Code (UPC), or any class of products sharing a common attribute as previously discussed.

Second, cumulate the transactions records for each consumer by time period, such as by month. That is, identify the number of item or services purchased in association with a consumer identification within each one of classes P1, . . . P12, in each month. Tables 1 and 2 illustrate this step.

In FIG. 7, table 1 shows a customer's transactions in association with dates of transaction for transaction classes P1 to P12. Each row in table 1 represents a transaction and specifies the date of the transaction and the quantity of items transacted in each class.

In FIG. 8, table 2 shows the cumulation of quantity of items transacted in classes P1 to P12 by time period. More specifically, table 2 shows the cumulation of transaction data for customer 1 in each one of months of January through May of the year 2000 (1/00 to 5/00). Alternatively, place a 1 in the cells in table 2 for any month and Pi in which the consumer has purchased at least one item.

Second, for each class Pi and each one of the time periods, calculate the fraction of the consumer records of the first set of consumers having a non-zero value.

In FIG. 9, table 3 graphically illustrates the result of the second step.

Third, identify a first subset of cumulated transaction history records for consumers of the first set that purchased at least 1 item in class P12 in time period 5/00. That is, filter from the larger set of records a sub set in which the fraction for P12 is 1. That is, include in the first subset only customer transaction records that have a non-zero value for transactions of class P12 in time period 5/00.

Fourth, repeat the second step on the first subset. That is, for each class Pi and each one of the time periods in the first subset, calculate the fraction of consumer records having a non-zero value.

In FIG. 10, table 4 graphically illustrates the result of the fourth step.

Fifth, subtract the values in cells in Table 3 from the values in the corresponding cell in Table 4.

In FIG. 11, table 5 illustrates the result of the fifth step. Table 5 shows values in each cell that are representative of correlations to the existence of transactions in P12 in time period 5/00. A positive value in table 5 indicates purchase in the corresponding class correlates to purchase of P12 in time 5/00, with the magnitude of the value indicating the degree of correlation. A negative value in Table 5 indicates failure to purchase in the corresponding class predicts purchase of P12 in time 5/00, with magnitude of the value indicating the degree of correlation.

Alternatively, instead of correlating to a fraction representing the existence or lack of existence of a transaction in each cell in table 5, steps 1 to 4 could have been modified so that table 5 correlated to the average number of items purchased. That is, tables 3 and 4 could have been determined by calculating the average number of items transacted instead of the fraction of the consumer records having non-zero values. In that case, the cells in table 5 would contain numbers indicating an average difference in number of items purchased for those in the first subset compared to the average number of items purchased for those in the first set. Use of data in this alternative would require subtracting the average values determined for the first set of consumers from the actual transaction data for a specific consumer, and then determining whether the result of that subtraction correlated to the alternative table 5 data. This additional step is not necessary when using the correlation data shown in table 5, as explained below.

The correlation data shown in table 5 may be used in many ways to estimate the likelihood or relative likelihood of a consumer transacting in class P12 in the future. Preferably, the correlation data is used by operating with it on an individual consumer transaction record to determine a likelihood of that consumer subsequently purchasing in P12 in the correlated time period, or a relative likelihood relative to other consumers, of that consumer subsequently purchasing P12 in the correlated time period.

A consumer having non-zero transactions in a class and a time period that is correlated to a future transaction in class P12 is a predictor that the consumer will subsequently conduct a transaction in P12. That prediction may be used as one factor in deciding whether and when to provide to that particular consumer a transaction incentive offer to purchase in class P12 in the correlated time period. A consumer having no transactions in any class and a time period that is correlated to a future transaction in class P12 is a predictor that the consumer will not subsequently conduct a transaction in P12 in the correlated time period. That prediction may be one factor used in deciding whether and when to offer that particular consumer an incentive to purchase in class P12. Moreover, correlations or lack thereof may be used to determine the value of any incentive transaction to offer to the consumer.

The identification of correlations and their magnitude indicated by non-zero values in table 5 may be used to combine the significance of some or all of the correlations. One method involves determine the difference in the fraction of consumers that purchase in two or more of the correlated classes identified in table 5 and subsequently purchase in class P12 in time period 5/00 (the correlated time period) to the fraction of consumers that purchase those two correlated products without regard to purchase of P12 in time period 5/00. Similarly, fractional difference correlations may easily be calculated for any combination of cells of table 5 showing a correlation. More than one such correlation value for two or more of the cells shown in table 5 may be calculated. The resulting correlations may be used individually or combined in a ranking function as discussed in the next paragraph.

Alternatively, we can derive a ranking function by defining a multi variable correlation formula applicable to the data for a consumer's transactions that cumulates the significance of each of the single class correlations shown in table 5. Applying the derived multi variable correlation formula to a second set of consumer transaction records results in values that rank each consumer record by the likelihood of the corresponding consumer purchasing in P12 in the correlated time period. An expression for an exemplary ranking function follows. Define a ranking function R(P12, n, Sk)=R, which defines a rank R for consumer record Sk's likelihood of transacting in class P12 in time period n. We define R(P12, n, Sk)=S(NCij if Cij is positive −Cij if Cij is negative), where Nij represents the number of items transacted in class i in time period j, S represents summation over values of i and j, Cij are the correlation, values derived as indicated for table 5, and j runs from 1 to 12, or 1 to 11. In the example represented by table 5, non-zero values of Cij only exist for certain i values wherein j=n−1 (time period 4/00) or j=n−2 (time period 3/00). However, we recognize that correlations between more than three time periods, and non-consecutive time periods may exist, and the general form for R(P12, n, Sk) accounts for these possibilities.

Any of the multi variable correlation functions applied to a set of consumer transaction records will result in a distribution of values for consumer records Sk. The records of some consumers will have relatively high R values and the records of other consumers will have relatively low R values. Relatively high values indicate a relative likelihood that the corresponding consumers will purchase in category P12 in the predicted time period. The system of the invention may use both relatively high values and anticipated time period of purchase and relatively low values as inputs in determining whether to make available to a consumer an incentive offer and when to make the incentive offer available.

Statistical analysis, heuristic, or ad hoc rules may be applied to the R function to map R values to probabilities in the range 0 to 1. The simplest method of mapping the R values for a set of consumers to a probability range is to normalize the R values by dividing all of the R values for a set of consumers by the largest R value in the set. The results of the ranking or probability of purchase of P12 in time period 5/00, the correlated time period, may be used in decisions regarding offering of incentives in a variety of ways.

In the case of a relatively high likelihood of a consumer purchasing in class P12 in a correlated time period, one of the manufacturers competing for sales in class P12 may decide to offer the consumer a discount on their brand product in class P12, and at a time, such as a day a few days, or a week prior to the anticipated purchase time period, or during the anticipated purchase time period. If the consumer's prior purchase data shows that the consumer preferentially purchases that manufacturer's products in that class P12, the manufacturer may decide to offer no incentive or to offer only a low value incentive. If the consumer's prior purchase data shows that the consumer preferentially purchases a different manufacturer's product in class P12, the manufacturer may decide to offer a large value incentive in an attempt to induce the consumer to try its product. If the consumer's prior purchase data shows that the consumer has purchased from a variety of different manufacturer's product in class P12, the manufacturer may decide to offer an intermediate value purchase incentive in an attempt to induce the consumer to try its product. The determinations based upon consumer's preference to a different manufacturer in class P12 is an example of a class and subclass determination. The class is P12. The subclass is the brand of the manufacturer of products in P12 to which the customer's prior sales are associated.

One benefit of this invention is that it enables manufacturers and retailers to make determinations regarding incentive determinations based upon a ranking or likelihood of a consumer purchase in a class and additional data regarding the consumer's preferences or purchase history in that class or a subclass of that class. Thus, the invention provides for making decisions regarding purchase incentive offers based upon a customer's prior purchase history in (1) a class and (2) a subclass of that class and correlations to either the class or the subclass indicating likelihood of the consumer purchasing in either the class or the subclass in a correlation time period.

In the case of a relatively low likelihood of a consumer purchasing in class P12 in a correlated time period, a manufacturer may decide either to offer the consumer a relatively high value purchase incentive for purchasing in P12 or to forego offering the consumer a purchase incentive in P12.

The foregoing references to manufacturers in this example apply equally to retailers. Especially when those retailers (1) desire to increase sales by offering purchase incentives on products a consumer is unlikely to otherwise purchase, (2) introduce their own brands in their stores (house label brands) in competition with the other brands they sell in their stores, and (3) desire to motivate consumers to perform subsequent shopping transactions in stores of the same retailer rather than in stores of different retailer, such as a competing retailer.

One advantage of the invention is that it enables a determination of a correlation time period when a consumer is unlikely to purchase in a specified class, even when the consumer's purchase history shows transactions in that class in the past. This enables manufacturers and retailers to avoid the expense of generating and transmitting to consumers incentive offers that have little likelihood of being used.

The ranking and probability determinations identified above for P12 can be repeated for each one of classes P1 to P11. Manufacturers and retailers may make decisions regarding incentive offers for a consumer based upon all of that information. For example, a manufacturer may decide to offer an incentive to a consumer only for the one class in which it is most likely that the consumer will purchase in the class, and will likely purchase from another manufacturer. Alternatively, a manufacturer may decide to offer a specified number of transaction incentives to each consumer, and provide to each consumer incentives in those classes in which it is most likely that the consumer will purchase in the correlated time period, and depend the value of the incentive to each consumer upon the consumer's likelihood of purchasing from the manufacturer as determined by the consumer's prior purchase history. Alternatively, the manufacturer may decide to provide incentives only to consumers that previously have consistently purchased from that manufacturer in a class but appear unlikely to purchase in that class in the correlated time period.

Demographic data provides additional information which can be correlated to consumer's purchases in specified classes. For example, consumer demographic data may show that consumers having at least one child under the age of 12 in their household are 70 percent likely to purchase a dairy product in each shopping transaction at a supermarket, whereas consumers without at least one child in their household are only 25 percent likely to purchase at least one diary product in each shopping transaction at a supermarket. This time independent correlation data may be combined into the foregoing ranking functions to account for this known demographic effect upon a consumers anticipated transactions in the correlated time period. For example, the R function defined above may be modified by adding a fraction corresponding to the demographic based statistical likelihood of a purchase in the target class. In that alternative, the function R(P12, n, Sk)=S(NCij if Cij is positive −Cij if Cij is negative) could be modified by adding a term Dk where D represents the correlation value (0.70 for a consumer who has a child under 12 and 0.25 for a consumer who does not have a child under age 12) and k represents the specific consumer. Similarly, any other data providing a statistical correlation to certain consumer transactions may be included in a ranking or probability function indicating likelihood or relative likelihood of a specific consumer transacting in a class in a time correlated period. For example, a retailer loyalty quotient, Q, which is an estimate of the fraction of a consumer's total grocery shopping dollars spent at a certain retailer, may be included in a ranking function. For example the value of Qk for consumer k can be added to the definition of the ranking function such that R(P12, n, Sk)=S(NCij if Cij is positive −Cij if Cij is negative)+Dk+Qk.

Additional time independent consumer specific data which may be included in the ranking of consumers includes a measure of the consumer's likelihood to redeem transaction incentives that have been provided to them in various ways, such as via direct mail, email, at the point of sale, at kiosks not at the point of sale both in a retail store and out of the retail store, and transaction incentives over the Internet at web sites. The time independent data relating to the consumer's likelihood of redeeming based upon the modality of transmission of the transaction incentive to the consumer may be used in determining which modality to use in providing the transaction incentive to the consumer.

An example of a predictive model is Ps(x1, x2, x3) where Ps is a statistical probability of purchase of canned peas in a later time period, x1, x2, and x3 are measures of purchase of milk, cereal, and paper goods in a prior time period. For example, the functional form of Ps may be Ps=f1(x1)+f2(x2)+f3(x3) where 11 is a function ranging from 0 to ½, and f2 and f3 are both functions ranging from 0 to ¼. For example of the functional form of f1, f1 may be zero if x1 indicates no purchase of milk in the prior time period and ranging up to ½ with increasing quantity of milk purchased in the prior time period up to 2 gallons of milk. For example of the functional form of f2, f2 may be ¼ if x2 is zero, and x2 may decrease with increasing x2 until x2 reflects purchase of at least 5 dollars of cereal in the prior time period. For example of the functional form of f3, f3 may be zero when the value of x indicates no purchase of paper goods in the prior time period, and f3 may range up to ¼ as the value of x3 may increase polynomially to indicate purchase of $20.00 in paper goods.

Another example of a predictive model is Ps(x1, x2, x3) in which Ps is the probability of a change in purchase behavior of the specified product in the earlier and later time periods. For example, Ps may be the probability that quantity of the specified product will decrease, remain constant, or increase. Ps may or may not be a function of a measure of purchase of the specified quantity in the prior time period. As a specific example, Ps may be the probability of the quantity of purchase of canned peas in the second time period being no more than 50 percent the quantity of canned peas purchased in the first time period. Again, one possible predictive model is Ps=f1(x1)+f2(x2)+f3(x3). Obviously, the number of variables, x1, x2, and x3 may vary. Preferably, the model includes at least two variables, more preferably three or more variables. The number of variables, x, in a predictive model of the invention may range from 2 to over 100, but preferably, in view of current digital processing limitations, is in the range of 3 to 15, more preferably 5 to 10. The functional form of Ps, the predictive model, may also vary greatly. However, the functional form of Ps should reflect the underlying correlation of the variables x1, x2, etc with the change in purchase quantity in the earlier and later time periods of the specified product. The functional form of Ps may be predetermined by the user or automatically determined by computer code generating the predictive models applicable to each specified product.

This invention will allow a company with a database containing detailed shopping records for numerous consumers to help a cold cereal manufacturer protect its sales by providing an incentive to a consumer who has purchased a specific product of that manufacturer's cereal in the recent past but might stop purchasing the cereal product (hereinafter in this example referred to as “cereal”) in the future. In order to accomplish this goal, all of the consumers' purchasing records are read into a computer analysis and modeling program which uses one or a combination of regression analysis, neural network analysis, and decision tree analysis to identify characteristics in the data records that correlate with the subsequent purchase and non-purchase of cereal and to formulate a predictive model based upon the identified correlations. In formulating a prediction of a specific consumer's purchases, the modeling program relies on the predictive model generated by an analysis of statistical population of data records (CID records) applied to actual data in that specific consumer's purchasing records. The purpose of the modeling program is to examine leading indicators and determine which consumers are going to change their spending habits and in what way. For example, there may be 30 or 40 products that are highly correlated to the future purchase or non-purchase of cereal. Some of the products could be complimentary products, such as milk, bananas, or sugar. Other products might be competing products, such as oatmeal, instant breakfast drinks, muffins, or a competitors brand of cereal. Still other products might be considered which would appear to have no a prior or readily discernable rational reason for correlation to the sale of cold cereal, such as Pez candy or nail polish. The prior purchase of cereal, the target class, may or may not be one of the variables that comprises the model.

In addition to products, modeling may use a wide variety of demographic data to develop correlations based on indication of a change in lifestyle. Some demographic reasons a consumer would alter the consumer's consumption of cereal include: an only-child graduating high school and leaving the household (decreased demand compared to the same household) or the presence of a large number of children under the age of 15 in a household (increased demand for cereal compared to the average household). Note that the preceding two examples that the model can accommodate are comparisons between the household at two different times (the child going away to college) and a comparison between the household and other households generally (the household with a large number of children under the age of 15). Lifestyle factors may also be included in the analysis, such as a job change necessitating a longer commute which did not allow the consumer time to sit down and eat breakfast, the consumer may have started a fad diet that consisted only of meat (eliminating cereal from the foods consumers would eat), or the consumer's refrigerator may have broken and the consumer was unable to keep cold milk in the home. Other factors that may indicate that the consumer might stop purchasing cereal are related to the consumer's shopping habits. If the consumer constantly changed the type of cereal the consumer purchased, it might indicate the consumer did not find a type of cereal to the consumer's liking and would make the consumer more likely to stop purchasing cereal than a consumer that habitually bought a certain brand and product. Such a non-loyal consumer is a prime target for an incentive for cereal. A decline in the consumer's frequency of purchasing cereal or a decrease in the size of the cereal box purchased, or an increase in the price of cereal might forestall a future decline in the consumer's cereal purchases.

In addition to identifying products which conflate to the non-purchase of cereal, system 20's analysis formulates the relative weighting of each variable and the relevant time period or time periods that provide improved relatively larger correlations. System 20 may be programmed to evaluate purchases from two or more prior time periods and from time periods of different durations in generating a predictive model. Alternatively, system 20 may be limited to a program based on only two time periods of equal duration. For example, system 20 might predictively model consumer behavior from each one of a subsequent 6 months based upon consumer transaction information for the month of September.

As stated above, all of the variables can be derived by the computer program based on a combination of regression analysis, neural network analysis, and decision tree analysis. However, if the user chooses, any variable may be established by the user.

In a preferred embodiment, after system 20 defined the predictive model function, it applies the predictive model function to a CID record for each consumer in a set of consumers resulting in a score for each CID record for the targeted behavior. The score is preferably then compared to a value to determine if the consumer associated with the CID will receive an incentive. For example, the model for the cereal incentive program introduced above might contain the following product variables: the purchase of milk, napkins, limes, 35 millimeter film, bananas, frozen pizza, oatmeal, sugar, coffee, tissues, and canned corn; as well as the following demographic data: household income and number of children under the age of 16. Each variable is evaluated to a number. The values of the numbers may be discrete (for example, a “one” for the purchase of the product during the applicable period and a “zero” for the absence of a purchase of the product during the relevant time) or continuous. These values may be summed to produce a score for each CID. The score need not be an estimate of probability, per se. The model could equate a percentile to a score (for example, a score of seven indicates that seventy percent of the consumers will purchase cereal during the relevant period and a score of eight might indicate that eighty-five percent of the consumers will purchase cereal during the relevant period). This would enhance decision making regarding which consumers to provide an incentive to, for which product, and how enticing the incentive will be. The focus of the scoring could also be to indicate that the consumer was about to stop purchasing cereal, that the consumer was about to purchase less cereal, the consumer was about to purchase a different brand of cereal, and that the consumer was going to purchase a hot cereal instead of a cold cereal.

Alternative embodiments of the invention relating to non-transaction assumed demographic data are described below.

Data stored preferably in database 30, but which may be stored in any one of databases 30, 70, and 80 includes may include fields for actual demographics data, assumed demographics data, or both types of data. For example, that data fields may include one or more fields for household income level. For example a first field may store a first value which indicates annual household or consumer income under $50,000 per year or a second value which indicates annual household income over $50,000 per year. That first field may be for non-assumed demographic data, such as data provided by the consumer. In addition, in the same database, there may be a second field for assumed income level wherein the same first and second values indicate the same assumed income levels as in the non-assumed income level field. The demographic data may stored in various forms, such as one field for each demographic value or range, and there may be multiple demographic data values or ranges per demographic variable. For example, there may be one field storing age of consumer wherein the numeric value in that field represents the consumer's age in years. Alternatively, there may be one field for range of each age in years (0-9, 10-19, 20-29, 30-59, 60-100 for example) wherein each such field stores a boolean value, and only one field stores a boolean positive value indicating the age of the consumer. Where, in the case of assumed demographic data, age is represented by a statistical probability of the consumer being have a specified age, one field may store a value associated with a set of statistical probabilities for one or more ranges or values of the demographic variable. Alternatively, the database may contain a plurality of fields each of which stores the statistical probability of the consumer having a value of the demographic variable in a particular range or of a particular value. An example of the last alternative would be a series of fields each of which stores a value from zero to one representing the probability that the consumer is of a specified age range, such one field indicating age between 18 and 19 years, a next field indicating age between 19 and 20 years, a next field indicating age between 29 and 21 years, and so on. In this last alternative, preferably, the sum of the values stored in all of the age fields essentially equals 1.0. However, the sum of the values stored in all of the age fields may essentially equal a number other than one, so long as the same normalization is applied to all records for all consumers to which this record is to be related by subsequent processing.

Moreover, the definition of fields in the database may not distinguish between actual non-transaction demographic data and assumed non-transaction demographic data such that the data in the non transaction demographic fields associated with any CID is not indicated, for example in field definitions, to be actual or assumed data.

The database may also store in association with a CID assumed block data identification. Assumed block data identification is identification of a bock (sometimes referred to as a block group) with a CID assumed based upon correlation of assumed data in the customers' record matching data for the block group in a geographic region relatively close to the location of the retail store. Relatively close to the retail store means close enough to the retail store's location that it is likely that some substantial fraction of the people living in the block group will shop in the retail store. In addition, the database may not distinguish between assumed block data identification and actual block data identification associated with any CID.

The database may also store, in association with a CID, fields derived from non-transaction assumed demographic data including block data for the block to which the customer belongs, loyalty quotient data for the customer's loyalty to a store, brand, or any class of transactions. These assumed derived fields may contain data representing likelihood of a consumer purchasing in a specified class, at a specified store, or in a store of a specified retail chain.

The correlations resulting from consumer transaction data may be combined with correlations to consumer purchase preferences from other sources, such as correlations from actual demographic data, assumed non-transaction demographic data, and customer loyalty quotients. The correlations of a particular customer to these types of data may be incorporated into a ranking or probability function in predicting the likelihood of a consumer entering into a specified transaction in a specified future time period.

Preferably, analysis system 20 is either hard wired or programmed to perform the algorithms disclosed herein for predictive modeling, and the applications of models to rank assumed non-transaction demographic data, determine probabilities of assumed non-transaction demographic data, or make predictions based at least part upon the assumed data. However, retailer computer system 50 or manufacturer computer system 40 may be programmed to perform those functions.

In step 210, system 20 runs software or implements in hardware a form of predictive modeling to analyze a set of data for multiple CID records in database 30. The predictive modeling identifies correlations between the existence of transactions in one or more classes of transactions in one or more time periods and known demographic data. The predictive modeling may also include other correlations, such as correlations based upon a consumer's, block data and loyalty quotients to the known demographic data. The predictive modeling of course does that on a record by record basis, wherein each record is associated with one CID. The correlations identified above are then used to analyze CID data records wherein the value of the relevant demographic variable is unknown to determine either an assumed value for that variable or probabilities or expectations that the value of the demographic variable has certain values. That is, one or more correlations are used to define a function in the manner already described to generate a probability, prediction, or ranking relative to other records for a demographic variable to assume for a CID.

The assumed non-transaction demographic data may be used in a variety of ways.

Demographic data, pre se, has a substantial value. Accordingly, a set of CID records including at least one demographic field storing data based upon assumed non-transactional demographic data may be sold for value.

The assumed demographic data, if in the form of probabilities of values of demographic values, may be filtered to identify, for each CID, the demographic variable having the highest value. For example, a CID's record having a highest probability value of 0.1 for any age group and a highest value of 0.7 for income greater than $50,000.00 may be filtered to define a set of records associated with the particular demographic of income range and the value of that variable being greater than $50,000.00. Similar sets of records may be associated with each demographic variable so that each set of records for each demographic variable contains only those records from the original group of records in which the value of that particular demographic variable was the highest of any of the demographic probability values. Each such set of records associated with a particular demographic may be associated with a transaction incentive designed to target the demographic group associated with that set. For example, an incentive targeted to be effective for household incomes over $50,000.00, such as a discount on a luxury item, since an income over $50,000.00 is currently well above the average household income, may be associated with the CIDs associated demographic of household income greater than $50,000.00. The transaction incentive may then be offered to the consumer or consumers associated with the CID in any manner.

Given a set of CID records containing at least one such field, that data may be sorted by a region field, such as town, state, or postal code, and a subset of those records having only records for one or a few towns, states, or postal cods, may be offered for sale to retail stores having locations correlated with the towns, states, or postal codes of that subset of records. This is particularly useful because CID locations can be correlated with retail store geographic locations so that a retail store can use data for consumers in the geographic region around the locations of the retail store, since that is the region from which the store is likely to draw purchasers. For example, the CID records may be sorted to identify all records associated with zip code 22202, and then sold to retail stores located in or adjacent to that zip code.

Furthermore, the computer system may then use the assumed non-transaction demographic data, by comparison to demographics associated with the block regions in the local region of a particular retail store, to infer a specific consumer block region for each consumer. This may be done by correlating the values in the CIDs record, of variables including the assumed non-transaction demographic data, to demographic data of the block data for those blocks in the local geographic region surrounding the retail store from which the retail store draws customers. Then, the system assumes that the customer resides in the block region to which the CID's record is most highly correlated and stores that data in the record associated with the CID. Of course, if the consumer's actual residence location is stored in the CID's record, no such processing is needed for that consumer. The computer analysis system may determine the number of consumers assumed to be in each block region. This data may be used to identify relative numbers of a retails store's customers in each block region local to the retail store. The computer analysis system may also identify the fraction of the number of the store's customers assumed to be in each block region to the total number of consumers in that block region. The computer analysis system may also identify the fraction of the number of the store's customers assumed to be in each block region to the total dollar value spent in each block region. The computer system or the retails store's computer system may then preferentially target market based at least in part upon blocks regions where the identified fraction is relatively low, i.e., where the retail store has below average market share. For example, if the consumer is assumed to have demographic variables of age under 30, income between $20,000.00 and $30,000.00, and gender male, and correlation that age, income and gender for the 20 closest block groups surrounding the location of the retail store identifies a block group 13 as the most correlated, the computer system may assume that the consumer resides in block group 13. Alternatively, the computer system may define probability or expectation of the consumer residing in each one of the foregoing block groups, and then cumulate the probabilities for records for all consumers to assume the number of its customers in each block group region.

One way to obtain mixed derived demographic data is for the computer system to identify a set of records for consumers that do actually purchase a specified product or class of products or services, and then solicit survey information from consumers corresponding to that set of records. One way to solicit response to such a survey is to offer consumers corresponding to that set an incentive for participating in the survey. The incentive can be targeted in one of two ways. First, the incentive offered may depend upon the common product or service from which the class was specified in such a manner that the incentive is likely to be useful to purchasers of the common product or service. The computer system would in this methodology associated an incentive relating to the specified product or class of products or services used to define the set of consumers. Second, once the set of consumer's records was defined, the computer system could determine based upon criteria applied to record data, what incentive to offer each consumer whose record was contained in the set of records. Notably, the record data includes transaction data and product purchase data, and it may also include actual, assumed non-transaction, derived, or mixed derived demographic data.

The algorithmic portions of the invention are preferably implemented in software, which is also referred to as code. However, they may be implemented in hardware, for example, by burning an EPROM type of semiconductor chip or an ASICS type of semiconductor chip, and including the chip in a digital computer system.

Incentive offers may be generated via email, postal mail, target market advertising, POS terminal printing, and the like.

Our invention is not limited to the specific example above. It is more properly defined by the scope of protection we claim in the following claims. 

What is claimed is:
 1. A computer implemented method of predicting demographic traits of consumers based on purchase histories and identifying the predicted demographic traits that are most likely to be accurate, the method being implemented by a computer system having one or more physical processors programmed with computer program instructions that, when executed by the one or more physical processors, cause the computer system to perform the method, the method comprising: obtaining, by the computer system, a first set of consumer records related to a plurality of first consumers, wherein each of the first set of consumer records includes: a first set of known demographic traits for a corresponding first consumer, and first purchase information that indicates one or more first purchases made by the corresponding first consumer; obtaining, by the computer system, second purchase information that indicates one or more second purchases made by a second consumer; comparing, by the computer system, the first purchase information and the second purchase information to determine one or more correlations between the first purchase information and the second purchase information; determining, by the computer system, a set of assumed demographic traits of the second consumer based on the comparison and the first set of known demographic traits, wherein each assumed demographic trait: (i) predicts a demographic trait of the second consumer, and (ii) is associated with a probability that the assumed demographic trait is accurate; selecting, by the computer system, a subset of the set of assumed demographic traits based on the their respective probabilities such that at least a first assumed demographic trait is selected over at least a second assumed demographic trait based on a first probability that the first assumed demographic trait is accurate and a second probability that the second assumed demographic trait is accurate; and identifying, by the computer system, an incentive to provide to the second consumer based on the subset.
 2. The method of claim 1, the method further comprising: providing the incentive to the second consumer through at least a first communication channel from among a plurality of communication channels.
 3. The method of claim 2, the method further comprising: determining a redemption probability that indicates a likelihood of a redemption of the incentive by the second consumer if the incentive is provided through the first communication channel; and determining whether to provide the incentive through the first communication channel based on the redemption probability.
 4. The method of claim 1, wherein the first assumed demographic trait relates to a first value for a given type of assumed demographic trait and the second assumed demographic trait relates to a second value for the given type of assumed demographic trait.
 5. The method of claim 1, wherein the first assumed demographic trait relates to a first type of assumed demographic trait and the second assumed demographic trait relates to a second type of assumed demographic trait.
 6. The method of claim 1, wherein selecting the subset comprises: filtering the set of assumed demographic traits based on: (i) respective values of each of the set of assumed demographic traits, (ii) respective types of each of the set of assumed demographic traits, and (iii) the respective probabilities that each of the set of assumed demographic traits is accurate.
 7. The method of claim 6, wherein filtering the set of assumed demographic traits comprises: filtering at least a first type of assumed demographic trait based on a first probability threshold and filtering at least a second type of assumed demographic trait based on a second probability threshold different from the first probability threshold.
 8. The method of claim 1, wherein a given probability that a given assumed demographic trait is accurate is based on a level of correlation between the first purchase information and the second purchase information.
 9. The method of claim 1, wherein a given probability that a given assumed demographic trait is accurate is based on a level of reliability of correlations between the first purchase information and the second purchase information in predicting the given assumed demographic trait.
 10. The method of claim 1, wherein selecting the subset comprises: ranking the set of assumed demographic traits with respect to each other based on the respective probabilities, such that the first demographic trait is ranked with respect to the second demographic trait, wherein the selected subset is based on the ranking.
 11. The method of claim 1, wherein the second consumer is associated with an assumed or known demographic trait that indicates a consumer location related to the second consumer, the method further comprising: determining a retail location associated with a retailer; and correlating the consumer location with the retail location to determine whether the consumer location is within a certain region of the retail location, wherein the incentive is identified based further on the correlation.
 12. The method of claim 11, the method further comprising: determining a retail location associated with a retailer; and performing demographic analysis based on the retail location and at least some of the set of assumed demographic traits.
 13. The method of claim 12, wherein the second consumer comprises a plurality of second consumers, and wherein the subset of the set of assumed demographic traits relates to a respective consumer location for each of the plurality of second consumers, wherein performing demographic analysis comprises: associating a set of the plurality of second consumers that made a purchase at the retailer with one or more regions within a certain proximity to the retail location based on the second purchase information and the respective consumer location for each of the plurality of second consumers; and determining a number of consumers of the retailer in the one or more regions based on the association between the set of the plurality of second consumers and the one or more regions.
 14. The method of claim 13, the method further comprising: determining a total number of consumers that made a purchase at the retailer based on the second purchase information; and calculating a fraction of consumers in the one or more regions relative to the total number of consumers based on the number and the total number; and determining whether to provide the incentive based on the fraction.
 15. The method of claim 12, wherein the second consumer comprises a plurality of second consumers, and wherein the subset of the set of assumed demographic traits relate to a respective consumer location for each of the plurality of second consumers, wherein performing demographic analysis comprises: associating a set of the plurality of second consumers that made a purchase at the retailer with one or more regions within a certain proximity to the retail location based on the second purchase information and the respective consumer location for each of the plurality of second consumers; and determining a value of purchases that the set of the plurality of second consumers in the one or more regions made at the retail location based on the association between the set of the plurality of second consumers and the one or more regions.
 16. The method of claim 15, the method further comprising: determining a total value of purchases made at the retailer based on the second purchase information; and calculating a fraction of the value of purchases made in the one or more regions relative to the total value based on the value and the total value; and determining whether to provide the incentive based on the fraction.
 17. A system of predicting demographic traits of consumers based on purchase histories and identifying the predicted demographic traits that are most likely to be accurate, the system comprising: a computer system comprising one or more physical processors programmed with computer program instructions to: obtain a first set of consumer records related to a plurality of first consumers, wherein each of the first set of consumer records includes: a first set of known demographic traits for a corresponding first consumer, and first purchase information that indicates one or more first purchases made by the corresponding first consumer; obtain second purchase information that indicates one or more second purchases made by a second consumer; compare the first purchase information and the second purchase information to determine one or more correlations between the first purchase information and the second purchase information; determine a set of assumed demographic traits of the second consumer based on the comparison and the first set of known demographic traits, wherein each assumed demographic trait: (i) predicts a demographic trait of the second consumer, and (ii) is associated with a probability that the assumed demographic trait is accurate; select a subset of the set of assumed demographic traits based on the their respective probabilities such that at least a first assumed demographic trait is selected over at least a second assumed demographic trait based on a first probability that the first assumed demographic trait is accurate and a second probability that the second assumed demographic trait is accurate; and identify an incentive to provide to the second consumer based on the subset.
 18. The system of claim 17, wherein the computer system is further programmed to: provide the incentive to the second consumer through at least a first communication channel from among a plurality of communication channels.
 19. The system of claim 18, wherein the computer system is further programmed to: determine a redemption probability that indicates a likelihood of a redemption of the incentive by the second consumer if the incentive is provided through the first communication channel; and determine whether to provide the incentive through the first communication channel based on the redemption probability.
 20. The system of claim 17, wherein the first assumed demographic trait relates to a first value for a given type of assumed demographic trait and the second assumed demographic trait relates to a second value for the given type of assumed demographic trait.
 21. The system of claim 17, wherein the first assumed demographic trait relates to a first type of assumed demographic trait and the second assumed demographic trait relates to a second type of assumed demographic trait.
 22. The system of claim 1, wherein the computer system is further programmed to: filter the set of assumed demographic traits based on: (i) respective values of each of the set of assumed demographic traits, (ii) respective types of each of the set of assumed demographic traits, and (iii) the respective probabilities that each of the set of assumed demographic traits is accurate, wherein the subset is selected based on the filtered set of assumed demographic traits.
 23. The system of claim 22, wherein the computer system is further programmed to: filter at least a first type of assumed demographic trait based on a first probability threshold and filtering at least a second type of assumed demographic trait based on a second probability threshold different from the first probability threshold.
 24. The system of claim 17, wherein a given probability that a given assumed demographic trait is accurate is based on a level of correlation between the first purchase information and the second purchase information.
 25. The system of claim 17, wherein a given probability that a given assumed demographic trait is accurate is based on a level of reliability of correlations between the first purchase information and the second purchase information in predicting the given assumed demographic trait.
 26. The system of claim 17, wherein the computer system is further programmed to: rank the set of assumed demographic traits with respect to each other based on the respective probabilities, such that the first demographic trait is ranked with respect to the second demographic trait, wherein the selected subset is based on the rank.
 27. The system of claim 17, wherein the second consumer is associated with an assumed or known demographic trait that indicates a consumer location related to the second consumer, wherein the computer system is further programmed to: determine a retail location associated with a retailer; and correlate the consumer location with the retail location to determine whether the consumer location is within a certain region of the retail location, wherein the incentive is identified based further on the correlation.
 28. The system of claim 27, wherein the computer system is further programmed to: determine a retail location associated with a retailer; and perform demographic analysis based on the retail location and at least some of the set of assumed demographic traits.
 29. The system of claim 28, wherein the second consumer comprises a plurality of second consumers, and wherein the subset of the set of assumed demographic traits relates to a respective consumer location for each of the plurality of second consumers, wherein the computer system is further programmed to: associate a set of the plurality of second consumers that made a purchase at the retailer with one or more regions within a certain proximity to the retail location based on the second purchase information and the respective consumer location for each of the plurality of second consumers; and determine a number of consumers of the retailer in the one or more regions based on the association between the set of the plurality of second consumers and the one or more regions.
 30. The system of claim 29, wherein the computer system is further programmed to: determine a total number of consumers that made a purchase at the retailer based on the second purchase information; and calculate a fraction of consumers in the one or more regions relative to the total number of consumers based on the number and the total number; and determine whether to provide the incentive based on the fraction.
 31. The system of claim 28, wherein the second consumer comprises a plurality of second consumers, and wherein the subset of the set of assumed demographic traits relate to a respective consumer location for each of the plurality of second consumers, wherein the computer system is further programmed to: associate a set of the plurality of second consumers that made a purchase at the retailer with one or more regions within a certain proximity to the retail location based on the second purchase information and the respective consumer location for each of the plurality of second consumers; and determine a value of purchases that the set of the plurality of second consumers in the one or more regions made at the retail location based on the association between the set of the plurality of second consumers and the one or more regions.
 32. The system of claim 31, wherein the computer system is further programmed to: determine a total value of purchases made at the retailer based on the second purchase information; and calculate a fraction of the value of purchases made in the one or more regions relative to the total value based on the value and the total value; and determine whether to provide the incentive based on the fraction. 